MPA 5830 - Task 04 Solutions

Ani Ruhil

2018-08-02


(1)

Read-in the data-file sent to you. It was downloaded from Kaggle and comes with the following information:

Mobile App Statistics (Apple iOS app store) The ever-changing mobile landscape is a challenging space to navigate. . The percentage of mobile over desktop is only increasing. Android holds about 53.2% of the smartphone market, while iOS is 43%. To get more people to download your app, you need to make sure they can easily find your app. Mobile app analytics is a great way to understand the existing strategy to drive growth and retention of future user.

With million of apps around nowadays, the following data set has become very key to getting top trending apps in iOS app store. This data set contains more than 7000 Apple iOS mobile application details. The data was extracted from the iTunes Search API at the Apple Inc website. R and linux web scraping tools were used for this study.

Dimension of the data set; 7197 rows and 16 columns

The contents include:

  1. “id” : App ID
  2. “track_name”: App Name
  3. “size_bytes”: Size (in Bytes)
  4. “currency”: Currency Type
  5. “price”: Price amount
  6. “rating_count_tot”: User Rating counts (for all version)
  7. “rating_count_ver”: User Rating counts (for current version)
  8. “user_rating” : Average User Rating value (for all version)
  9. “user_rating_ver”: Average User Rating value (for current version)
  10. “ver” : Latest version code
  11. “cont_rating”: Content Rating
  12. “prime_genre”: Primary Genre
  13. “sup_devices.num”: Number of supporting devices
  14. “ipadSc_urls.num”: Number of screenshots showed for display
  15. “lang.num”: Number of supported languages
  16. “vpp_lic”: Vpp Device Based Licensing Enabled

There may be an errant column that gets read-in, likely named X; feel free to NULL this.

(3)

Draw an appropriate plot that shows the distribution of apps’ primary genre. Make sure you title the plot, label the x-axis and y-axis. What three primary genres have the most apps? What primary genre has the fewest apps?

Games, Entertainment, and Education have the most apps, while Catalogs, Medical, and Navigation appear to have the fewest apps.

(6)

Load the EPA data used for Task 03. Create a data-set of the number of cars of each make by year. Then restrict this data-set to only include the following makes – “Ford”, “General Motors”, “GMC”, “Chevrolet”, “Dodge”, “Pontiac”, “Honda”, “Mazda”, “Toyota”, “Subaru”, “BMW”, “Mercedes-Benz”. Now draw an appropriate plot that shows the number of cars per make and year. Use colors to distinguish between makes.

You could also do …

If you only filter(), you would have

(7)

Use an appropriate plot to explore the relationship between highway08 and city08 miles per gallon by year. Are the two related positively or negatively? Does the relationship appear to be weak or strong?

The two appear to be positively correlated and the relationship appears to be a strong one, especially so for recent years.